AITopics | global solution

Stochastic gradient descent (SGD) algorithm is the method of choice in many machine learning tasks thanks to its scalability and efficiency in dealing with large-scale problems. In this paper, we focus on the shuffling version of SGD which matches the mainstream practical heuristics. We show the convergence to a global solution of shuffling SGD for a class of non-convex functions under over-parameterized settings.

artificial intelligence, convergence, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.04)
North America > United States > New York > Tompkins County > Ithaca (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
(7 more...)

Genre: Research Report (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

cdce17de141c9fba3bdf175a0b721941-Paper-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 23:29:07 GMT

arxiv preprint arxiv, classifier, loss function, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Ohio (0.04)
North America > United States > Michigan (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

incorporating all the comments

Neural Information Processing SystemsFeb-11-2026, 08:17:27 GMT

We will add this in the revision.

artificial intelligence, machine learning, mfg, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.54)

Add feedback

6c1e55ec7c43dc51a37472ddcbd756fb-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 19:14:58 GMT

algorithm, data provider, learner, (16 more...)

Neural Information Processing Systems

Country:

North America > United States (0.05)
Europe > United Kingdom > England > Hampshire > Southampton (0.04)
North America > Canada (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.82)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)

Add feedback

30d411fdc0e6daf092a74354094359bb-Supplemental.pdf

Neural Information Processing SystemsFeb-8-2026, 02:56:35 GMT

global solution, operator, pe solution, (15 more...)

Neural Information Processing Systems

Country: North America > United States > California > San Diego County > San Diego (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

30d411fdc0e6daf092a74354094359bb-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 02:56:32 GMT

global solution, pe solution, program synthesis, (15 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.14)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
North America > United States > California > Santa Clara County > San Jose (0.04)

Genre: Research Report (0.68)

Technology:

Information Technology > Software (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.43)

Add feedback

On the Convergence to a Global Solution of Shuffling-Type Gradient Algorithms

Neural Information Processing SystemsDec-27-2025, 03:50:43 GMT

Stochastic gradient descent (SGD) algorithm is the method of choice in many machine learning tasks thanks to its scalability and efficiency in dealing with large-scale problems. In this paper, we focus on the shuffling version of SGD which matches the mainstream practical heuristics. We show the convergence to a global solution of shuffling SGD for a class of non-convex functions under over-parameterized settings. Our analysis employs more relaxed non-convex assumptions than previous literature. Nevertheless, we maintain the desired computational complexity as shuffling SGD has achieved in the general convex setting.

convergence, global solution, shuffling-type gradient algorithm, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.63)

Add feedback

Global Solutions to Non-Convex Functional Constrained Problems with Hidden Convexity

Fatkhullin, Ilyas, He, Niao, Lan, Guanghui, Wolf, Florian

arXiv.org Artificial IntelligenceNov-14-2025

Constrained non-convex optimization is fundamentally challenging, as global solutions are generally intractable and constraint qualifications may not hold. However, in many applications, including safe policy optimization in control and reinforcement learning, such problems possess hidden convexity, meaning they can be reformulated as convex programs via a nonlinear invertible transformation. Typically such transformations are implicit or unknown, making the direct link with the convex program impossible. On the other hand, (sub-)gradients with respect to the original variables are often accessible or can be easily estimated, which motivates algorithms that operate directly in the original (non-convex) problem space using standard (sub-)gradient oracles. In this work, we develop the first algorithms to provably solve such non-convex problems to global minima. First, using a modified inexact proximal point method, we establish global last-iterate convergence guarantees with $\widetilde{\mathcal{O}}(\varepsilon^{-3})$ oracle complexity in non-smooth setting. For smooth problems, we propose a new bundle-level type method based on linearly constrained quadratic subproblems, improving the oracle complexity to $\widetilde{\mathcal{O}}(\varepsilon^{-1})$. Surprisingly, despite non-convexity, our methodology does not require any constraint qualifications, can handle hidden convex equality constraints, and achieves complexities matching those for solving unconstrained hidden convex optimization.

artificial intelligence, cit, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2511.10626

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
North America > United States > Texas > Travis County > Austin (0.04)
North America > United States > New York > Monroe County > Rochester (0.04)
(4 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.67)

Add feedback

Collaborating Authors

global solution

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

4b3cc0d1c897ebcf71aca92a4a26ac83-Supplemental-Conference.pdf

On the Convergence to a Global Solution of Shuffling-Type Gradient Algorithms Anonymous Author(s) Affiliation Address email

On the Convergence to a Global Solution of Shuffling-Type Gradient Algorithms Lam M. Nguyen

cdce17de141c9fba3bdf175a0b721941-Paper-Conference.pdf

incorporating all the comments

6c1e55ec7c43dc51a37472ddcbd756fb-Paper.pdf

30d411fdc0e6daf092a74354094359bb-Supplemental.pdf

30d411fdc0e6daf092a74354094359bb-Paper.pdf

On the Convergence to a Global Solution of Shuffling-Type Gradient Algorithms

Global Solutions to Non-Convex Functional Constrained Problems with Hidden Convexity